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' ABSTRACT 

We present measurements of the higher-order clustering of red and blue galaxies as 
' a function of scale and luminosity made from the two-degree field galaxy redshift 

I^J \ survey (2dFGRS). We_use a counts-in-cells analysis to estimate the volume averaged 

£^ ■ correlation functions, £ p , as a function of scale up to order p = 5, and also the reduced 

void probability function. Hierarchical amplitudes are constructed using the estimates 
(T) . of the correlation functions: S p = Cp/Cf" • We find that: 1) Red galaxies display 

stronger clustering than blue galaxies at all orders measured. 2) Red galaxies show 
values of S p that are strongly dependent on luminosity, whereas blue galaxies show 
no segregation in S p within the errors; this is remarkable given the segregation in 
the variance. 3) The linear relative bias shows the opposite trend to the hierarchical 
. amplitudes, with little segregation for the red sequence and some segregation for the 

O ^ blue. 4) Faint red galaxies deviate significantly from the "universal" negative binomial 

reduced void probabilities followed by all other galaxy populations. Our results show 
that the characteristic colour of a galaxy population reveals a unique signature in 
its spatial distribution. Such signatures will hopefully further elucidate the physics 
responsible for shaping the cosmological evolution of galaxies. 
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1 INTRODUCTION large numbers of galaxies over a large enough volume to al- 
low robust measurements of clustering to be made for such 
The study or the large scale structure of the Universe is now , , , - T , , r>™r> n , ■ . i 
J , _ , . „ , , , , subsamples (e.g. Norberg et al. 2001, 2002a; Zehavi et al. 
entering a new phasa The two-degree field galaxy redshift Madgwick et al. 2003). 
survey (hereafter 2dFGRS; Colless et al. 2001, 2003) and the ' ' ' 6 ; 
Sloan Digital Sky Survey (SDSS; York et al. 2000; Adelman- There are man y observational clues which point to a 
McCarthy et al. 2006) have yielded high precision measure- dependence of galaxy properties on their local environment, 
ments of the power spectrum of galaxy clustering on large Dressier (1980) argued that galaxies follow a morphology- 
scales (Cole et al. 2005; Tegmark et al. 2006; Percival et al. density relation, with the fraction of early-type galaxies in- 
2007; Padmanabhan et al. 2007) . When these measurements creasing with the local density. Galaxy clusters have well de- 
are combined with high angular resolution maps of the tern- fined red sequences in the colour-magnitude relation (Bower, 
perature fluctuations in the cosmic microwave background Luce y & Ellis 1992 ' Stanford, Eisenhardt & Dickinson 1998). 
radiation (e.g. Hinshaw et al. 2003, 2007), tight constraints Analyses of the 2dFGRS and SDSS data sets have probed 
can be derived on many of the parameters in the cold dark the connection between density and galaxy colour or type 
matter model (e.g. Sanchez et al. 2006; Spergel et al. 2007). over a wider ran S e of environments than was previously pos- 
Within this context, the emphasis in large scale structure sible to reveal a S eneral bimodality in the galaxy population 
studies is shifting to measuring the clustering signal for sam- ( e -S- Lewis et aL 2002 ! Ho SS et aL 2003 ' 2004 5 Balo S n et aL 
pies of galaxies defined by intrinsic properties such as lumi- 2004 )- These results su SS est a correlation between the 
nosity, colour, morphology or spectral type, with the goal of nature of a 8 alax y and its local environment, or equivalent^ 
developing our understanding of the physics of galaxy forma- the mass of its host dark matter hal °- Measures of the clus- 
tion. The SDSS and 2dFGRS catalogues contain sufficiently terin S a mplltude of different populations of galaxies reveal 

different results, indicating that these populations sample 
the underlying mass distribution in different ways. Hence, 

* The Scottish Universities Physics Alliance such clustering measurements can potentially tell us how 
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the efficiency of the galaxy formation process depends upon 
halo mass. 

The 2dFGRS and SDSS allow us to push measurements 
of galaxy clustering beyond the traditional two-point corre- 
lation function or power spectrum. By extending the cluster- 
ing analysis to higher orders, we can extract new information 
about the connection between galaxies and matter. In previ- 
ous work, we employed a counts-in-cells analysis to measure 
the higher-order correlation functions (Baugh et al. 2004; 
Croton et al. 2004b) and the void probability function (Cro- 
ton et al. 2004a) for galaxies samples of different luminosity 
extracted from the 2dFGRS. We found that the higher-order 
correlation functions measured for 2dFGRS galaxies follow a 
hierarchical scaling pattern, characteristic of the clustering 
pattern which results from the growth of initially Gaussian 
fluctuations due to gravitational instability. Croton et al. 
(2004a) obtained independent confirmation of this result 
when they uncovered an unambiguous universal form for 
the void probability function. The values of the correlation 
functions do, however, show differences from the expectation 
for the best fitting cold dark matter model. Gaztafiaga et al. 
(2005) measured the three-point correlation function with- 
out averaging over a cell volume, using triangles of galax- 
ies with sides of varying length ratios. They found the first 
clear evidence for a non-zero second order or nonlinear bias 
parameter, suggesting that the relation of galaxies to the 
underlying dark matter may be more complicated than pre- 
vious analyses had suggested (e.g. Verde et al. 2002; Lahav 
et al. 2002). Measuring the higher-order correlation func- 
tions is a challenging task, even with surveys of the size 
of the 2dFGRS and the SDSS. Baugh et al. (2004) found 
that their measurements of the correlation function for L* 
galaxies in the 2dFGRS were affected on large scales by the 
presence of associations of rich clusters; Nichol et al. (2006) 
found similar effects in the SDSS. 

In this paper, we extend our earlier work by exploiting 
the availability of colour information for the 2dFGRS cata- 
logue. In Section 2 we briefly describe the 2dFGRS galaxy 
catalogue, the counts-in-cells approach we use and the statis- 
tics we measure, along with a reprise of how higher-order 
clustering measurements can be used to make deductions 
about galaxy bias. In Section 3 we present our results for 
the higher-order clustering of 2dFGRS galaxies selected by 
both luminosity and colour. We give a simple interpreta- 
tion of these results in Section 4 and present a summary in 
Section 5. 



2 DATA AND ANALYSIS 

The methodology we use is identical to that described by 
Baugh et al. (2004) and Croton et al. (2004a; 2004b). These 
authors measured the higher-order clustering and the void 
probability function for galaxies as a function of luminosity, 
analysing volume limited samples drawn from the 2dFGRS. 
In this paper we extend this earlier work to consider samples 
defined by galaxy colour in addition to luminosity, as we did 
for the case of the 3-point correlation function in Gaztafiaga 
ct al. (2005). Full details of the clustering measurements and 
a complete discussion of their interpretation can be found 
in the above references; in this section, for completeness, wc 
provide a brief outline of the 2dFGRS (§ 2.1), the statistics 



measured (§ 2.2 and § 2.3) and give a recap of the implica- 
tions of the higher-order clustering statistics for galaxy bias 
(§ 2.4). 



2.1 The 2dFGRS galaxy catalogue 

Our analysis employs the completed 2dFGRS (Colless et al. 
2001; 2003) which contains a total of 221,414 unique, high 
quality galaxy redshifts down to a nominal magnitude limit 
of bj w 19.45 with a median redshift z fs 0.11. In addi- 
tion to fcj-band magnitudes, 7?F-band images have now been 
scanned, allowing a bj — Rf colour to be defined for each 
galaxy. To maximise the volume sampled, we restrict our- 
selves to regions with spectroscopic completeness in excess 
of 50%; however, in practice, the typical completeness is 
much higher than this in the final 2dFGRS, with the mean 
completeness in spheres of radius < 10/i -1 Mpc better than 
~ 80%. The remaining incompleteness is accounted for using 
the volume weighting corrections described in Croton et al. 
(2004b). 

In order to construct a volume limited subsample from 
a flux limited redshift survey, it is necessary to model the 
redshift dependence of a galaxy's luminosity or magnitude 
in the passband in which the survey selection is defined. We 
apply the colour dependent k + e-correction model of Cole 
et al. (2005) to account for the average change in galaxy 
magnitude due to redshifting of the 6j-filter bandpass (the 
"k-correction" ) and also the associated typical galaxy evolu- 
tion (the "e-correction"). The purpose of defining a volume 
limited sample is to isolate galaxies of similar intrinsic lumi- 
nosity in a sample with a simple radial selection function to 
facilitate clustering measurements (see Norberg et al. 2001; 
2002a). Once such a sample has been constructed, we can 
subdivide it by galaxy colour, using the fej and i?p-band 
photometry available for 2dFGRS data. Cole et al. (2005) 
analysed the rest frame bj — Rf colour distribution of galax- 
ies in the 2dFGRS and found a clear division at a rest frame 
colour of bj — rp = 1.07. The bimodality in the colour distri- 
bution about this reference point leads to natural definition 
of "red" and "blue" galaxies. This separation by colour is 
similar, though by no means identical to the selection by 
spectral type employed in previous 2dFGRS analyses (Nor- 
berg et al. 2002a; Madgwick et al. 2002, 2003; Conway et al. 
2005; Croton et al. 2005; see figure 2 of Wild et al. 2005). 

The volume limited samples used in this paper are listed 
in Table 1 (see also table 1 of Croton et al. 2004b for fur- 
ther properties of these samples. Note that the improved 
k + e-correction used here has resulted in small differences to 
the total number of galaxies falling within each volume lim- 
ited boundary when compared with our previous work). Our 
analysis covers a wide range of luminosity, from ~ 0.21^ to 
~ 4 L£ T (the mean effective luminosities of the volume lim- 
ited samples differ by a factor of 6 from the faintest to bright- 
est). The error bars plotted on our measurements in each of 
the figures are jackknife estimates derived by subdividing 
each sample into 20 areas on the sky. Our measurements are 
strongly correlated from bin to bin, so it is essential to take 
this into account when fitting models to the measurements. 
For the purpose of deriving confidence intervals on fitted 
parameters, we compute a full covariance matrix using the 
ensemble of mock 2dFGRS catalogues described by Norberg 
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et al. (2002b). See section 4 for further details on how errors 
from the mocks are used in our analysis. 

2.2 The distribution of counts-in-cells and its 
moments 

The clustering statistics we employ require an accurate 
measurement of the count probability distribution function 
(CPDF) for the red and blue galaxy populations in our vol- 
ume limited samples. This is done using a counts-in-cells 
(CiC) analysis. The CPDF for a given smoothing scale is 
measured by throwing down a large number (2.5 x 10 7 ) of 
spheres of radius R within the survey volume. The proba- 
bility of finding exactly N galaxies within a sphere of this 
scale is given by: 



Pn(R) 



Nn 



(1) 



Here Pn(R) is the CPDF at the given scale R, where TVn is 
the number of spheres containing N galaxies out of a total of 
Nt spheres used. From the CPDF, the clustering moments 
of the distribution can be calculated directly. For example, 
the volume-averaged 2-point correlation function is given by 



UR) = 



]TP N (i?) (iV/iV-l) 2 



1/JV, 



(2) 



where the mean number of galaxies expected in the sphere, 
N, is simply 



Eq. |4] this expansion can be expressed as a function of N& 
only: 



(6) 



Therefore, if the hierarchical assumption is valid, a plot of 
X as a function of N& should produce a universal curve for 
galaxy catalogues with different mean densities and cluster- 
ing properties, assuming common hierarchical amplitudes 
for the different populations (we will test this assumption 
in Section 13 . 2f) . Note that clustered tracers which deviate 
strongly from a Poisson distribution will have VPF values 
X< I- 

The precise form of the reduced VPF is set by the hier- 
archical amplitudes, S p , for which different models of clus- 
tering predict different values. Here we summarise only the 
most successful model, the negative binomial model, which 
does very well at reproducing the measurements from the 
2dFGRS for both the VPF (Fig. 3 in Croton et al. 2004b) 
and the hierarchical amplitudes, S p (Table 1 in Baugh et 
al. 2004) . The reduced VPF and hierarchical amplitudes for 
the negative binomial model are 



X = ln(l + Nb)/N£ 2 
S P = (p-1)! 



(7) 



Further analytic models, like the minimal and thermody- 
namic, are explored in Croton et al. (2004b). 



TV 



NP^(R) 



(3) 



Expressions for the higher-order moments in terms of the 
CPDF are given in Appendix A of Gaztanaga (1994). 

In the hierarchical model of galaxy clustering, all 
higher-order correlations can be expressed in terms of the 
2-point function, £2, and dimensionless scaling coefficients, 
S p (see Bernardeau et al. 2002): 



{ p — Sp 



(4) 



Traditionally, S3 = £3/^2 is referred to as the skewness of 
the distribution and S4 = £4/ £,2 as the kurtosis. Given that 
both £ 2 and £ p (p > 2) can be evaluated using the CPDF, 
the hierarchical amplitudes S p can readily be measured and 
the scaling behaviour of Eq.|3] tested (i.e. the validity of the 
p — 1 power law dependence for higher-order correlations; 
see Baugh et al. 2004). 



2.3 The void probability function 

A complementary way in which to study the hierarchical 
clustering paradigm is through the reduced void probabil- 
ity function (VPF). Put simply, the reduced VPF, x> IS a 
parametrisation of Pq, the probability of finding an empty 
sphere in the galaxy distribution, in terms of the expectation 
from a purely Poisson galaxy distribution: 



X = -ln(Po) / N 



(5) 



White (1979) derived Eq.[5]after writing the void probability 
function as a power series expansion in the moments of the 
CPDF (£ p ) to all orders. Under the hierarchical ansatz of 



2.4 Higher-order moments and relative bias 

The distribution of galaxies could be quite different from 
that of the underlying dark matter. One could conceive of 
physical processes which could lead to a dependence of the 
efficiency of galaxy formation on the mass and perhaps the 
environment of dark matter hales (e.g. Croton, Gao & White 
2007). If this is indeed the case, galaxies should be regarded 
as biased tracers of the matter distribution. 

A simple model for galaxy bias was introduced by Fry & 
Gaztahaga (1993). These authors assumed that the density 
contrast in the galaxy distribution, S G , can be expressed 
as a general non-linear function of the density contrast of 
the dark matter, S DM , so that: S c = F[S DM ]. For density 
fluctuations smoothed on large enough scales so that the 
matter density contrast is of the order unity or smaller, this 
relation can be expanded in a Taylor series: 



c-G °k 1 cDMn/c 

k = 



(8) 



On scales where the variance, £° , is small, the leading 
order contribution to the variance is dominated by the linear 
term: 



?2 



,2 fDM . n rJDMi2 



(9) 



where bi is the so-called "linear bias", b. The leading order 
term in the expansion for the skewness, S3, is: 

S 3 G = ^(S 3 DM + 3 C2 ) , (10) 

where we use the notation C2 = 62/61 (expressions for the 
hierarchical amplitudes up to p — 7 are given in Fry & 



4 Croton et al. 



Table 1. The basic properties of the galaxy samples used in this paper (columns 1-3) and the best fitting relative linear bias b T (Eg. 1111 
column 4) and relative non-linear bias c' 2 (Eg, 1131 column 5). Also shown are the best fitting S p measurements to the hierarchical ratios 
plotted in Fig. [2] (columns 6-8). The three sections give the properties and results for red galaxies, blue galaxies, and all colours combined 
(included to facilitate comparison with earlier work). All errors correspond to the 95 % confidence interval (2-sigma) for fits carried out 
using measurements in the range R = 4.5 — 14/i _1 Mpc. 



population 


Mag. range 
M bj -5\o Sl0 h 




(Eq.HTJ 
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(Eq.EJ 


s 3 


Si 




RED GALAXIES 


-18.0 -19.0 
-19.0 -20.0 
-20.0 -21.0 


7710 
18693 
15147 


0.97 ±0.06 
0.89 ± 0.04 

1 


0.25 ±0.15 
0.13 ±0.16 



2.83 ±0.37 
2.69 ±0.34 
2.04 ±0.15 


13.5 ±4.5 
12.2 ±4.3 
6.4 ±2.4 


80 ±50 
74 ±50 
28 ± 15 



BLUE GALAXIES -18.0 -19.0 14086 0.84 ± 0.05 -0.11 ±0.09 2.00 ±0.13 5.35 ± 1.78 11.0 ±6.5 

-19.0 -20.0 22499 0.90 ± 0.04 -0.06 ±0.10 2.09 ±0.14 6.66 ± 1.47 29 ± 23 
-20.0 -21.0 15125 1 1.83 ±0.13 4.28 ±0.94 10.0 ± 5.5 



RED±BLUE GALAXIES -18.0 -19.0 21796 0.85 ± 0.06 0.02 ±0.12 2.44 ± 0.32 9.46 ±3.15 46 ± 33 
-19.0 -20.0 41192 0.88 ±0.04 0.00 ±0.11 2.35 ± 0.30 8.77 ± 3.27 44 ± 36 
-20.0 -21.0 30272 1 2.01 ±0.14 5.97 ±1.31 24 ± 13 



Gaztahaga 1993). S 3 encodes the non- linear gravitational 
evolution of the matter distribution from Gaussian initial 
conditions. Hence, two non-linear effects are present in S®: 
gravity (i.e. as contained in Sjp^ 1 ) and galaxy bias (i.e. as 
quantified by C2). Both terms are modulated by the linear 
bias, b\. Therefore, in general, the interpretation of the skew- 
ness measured in the galaxy distribution is not trivial. How- 
ever, on weakly non-linear scales we have detailed models of 
what to expect for the skewness of the dark matter (see, for 
example, Bernardeau et al. 2002; calculations have also been 
carried out for non-standard gravitational models and cos- 
mologies - see Gaztariaga & Lobo 2001). We can therefore 
hope to learn about non-linear galaxy biasing if we can mea- 
sure the skewness in the galaxy distribution, 5*3", on scales 
on which the underlying fluctuations are only weakly non- 
linear. 

It is useful to define a relative bias to aid with the in- 
terpretation of measurements of higher-order clustering. The 
relative bias describes the change in clustering signal com- 
pared to that measured for a reference sample. Using Eq. O 
as a guide, we define the relative linear bias of a sample, 
b T = bi/b*, as the square root of the ratio of the 2-point 
correlation function (or variance) measured for the sample, 
relative to that found for the reference sample, denoted by 
an asterisk: 




Our reference sample is the volume limited population of 
galaxies in the magnitude range — 20 > M;,, — 5 log 10 h > 
—21, which is one magnitude brighter than the one used in 
Croton et al. (2004a). The reason for choosing a brighter 
sample is directly related to the influence the large coherent 
superstructures have on the L* sample: it makes more sense 
to use as reference a sample which is not systematically af- 
fected by the presence of such rare massive structure. For 
completeness, in Table 1 we show the relative bias results for 
the volume limited "all colour" catalogues analysed in Cro- 



ton et al. (2004a), however now presented relative to this 
brighter reference sample. 

On scales for which a linear bias is a good approxi- 
mation, i.e. when bk — for k > 1, we can relate the Sp 
measured for different galaxy samples regardless of the value 
of S° M for the underlying mass: 

S? = p (12) 

More generally, if galaxy bias is non-linear one can introduce 
a measure to quantify the relative non- linear bias: 

/ (C2 — Cj) 1 / G c,G»\ / 1Q \ 

c 2 = jp, = 3 {0tS 3 - S 3 J , (13) 

where an asterisk denotes a quantity measured for the refer- 
ence sample. In general, if the reference sample is un-biased 
(i.e. b\ — 1 and c* =0), we then have c' 2 = C2 and b r — bi 
for all samples. 



3 RESULTS 

In this section we present the main results of the paper, 
which are inferred from our measurement of the count prob- 
ability distribution function (CPDF), as outlined in Section 
2, for the samples of red and blue galaxies listed in Table 1. 
We present the volume averaged correlation functions, £ p , 
in § 3.1, the hierarchical amplitudes, S p , in § 3.2, and the 
reduced void probability function (VPF) in § 3.3. 

3.1 The higher-order correlation functions 

Fig. [T] shows the volume averaged correlation functions for 
orders p = 2-5 as a function of smoothing scale. The left 
hand panel shows the measurements for red galaxies and 
the right panel for blue galaxies. In both panels the symbols 
show the results for samples in the magnitude range — 19 > 
Alt-, — 51og 10 /i > —20 (i.e. approximately L^j), while the 
adjacent dashed and dotted lines show samples which are 
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Figure 1. The p-point volume averaged correlation functions as a function of scale. The left and right panels show results for red and 
blue galaxies separately. In each panel, the symbols denote galaxies in the magnitude range —19 > — 51og 1( )/i > —20, while the 

adjacent dashed and dotted lines show the clustering measured for galaxies defined by magnitude ranges which are, respectively, one 
magnitude brighter and fainter (Table [TJ. Different symbols show different orders of clustering, as indicated by the key. 



one magnitude brighter and fainter respectively. The order 
of volume averaged correlation function is indicated by the 
symbol type shown in the legend. 

Red galaxies clearly show a larger clustering amplitude 
than the equivalent population of blue galaxies, as expected 
from earlier results obtained for the two-point correlation 
function as a function of colour (e.g. Zehavi et al. 2005 in 
the SDSS) or spectral type (Norberg et al. 2002a; Madgwick 
et al. 2003 in the 2dFGRS). The dependence of clustering 
strength on galaxy luminosity appears to be the strongest 
for the blue population, with the faintest blue galaxies be- 
ing more weakly clustered than the brightest. The situation 
is more complicated for red galaxies; both the faintest and 
brightest samples of red galaxies appear to be more strongly 
clustered than the red L* sample. There is even a sugges- 
tion that the faintest red galaxies become the more strongly 
clustered with increasing order, although the errors on the 
measurements are also increasing at a fixed scale for higher 
values of p. This type of behaviour was noted previously for 
the two-point correlation function. Norberg et al. (2002b) 
found an increase in the correlation length of early spectral 
types for samples faintwards of L* , with the faintest early 
types displaying a similar correlation length to the brightest 
early types. Zehavi et al. (2005) found a similar increase in 
clustering strength for faint red galaxies. This trend can be 
readily explained; the satellite population in massive clus- 
ters is predominantly made up of faint, red galaxies and 
clusters are known to be strongly biased tracers of the mass 
distribution (e.g. Padilla et al. 2004). 

3.2 The hierarchical amplitudes 

Fig. [2] extends the results presented in the previous subsec- 
tion by plotting the hierarchical amplitudes (as defined by 



Eq.|4}, S3 (the skewness), S4 (the kurtosis) and S5, as a func- 
tion of scale. This is done for red galaxies in the left panel 
and blue galaxies in the right panel. Again, the results are 
shown for samples with different luminosities, with symbols 
representing the measurement of galaxies in the magnitude 
range —19 > Mb, — 51og 10 h > —20, and dashed and dotted 
lines showing the 5* p for samples one magnitude brighter and 
fainter than this respectively. Red galaxies show a strong 
segregation of the hierarchical amplitudes with luminosity 
at a given clustering order; the S p for blue galaxies show 
little dependence on luminosity. 

On large scales, N-body simulations and perturbation 
theory suggest that the hierarchical amplitudes for the mass 
should only vary slowly with scale in the case of initially 
Gaussian fluctuations which grow through gravitational in- 
stability (e.g. Juskiewicz, Bouchet & Colombi 1993; Baugh 
et al. 1995; for a summary of the theoretical predictions, 
see Bernardeau et al. 2002). Observationally, the hierarchi- 
cal amplitudes for galaxies have traditionally been measured 
to be either approximately constant or decreasing slowly 
out to progressively larger smoothing scales (e.g. Gaztanaga 
1992; Bouchet et al. 1993; Gaztanaga 1994; Hoyle, Sza- 
pudi & Baugh 2000; Frith, Outram & Shanks 2006; see 
Bernardeau et al. 2002 for a review of previous measure- 
ments). However, recent work has shown that the presence 
of large, rare superstructures in the galaxy distribution can 
produce a significant upturn in the value of S p on large 
scales (e.g. Szapudi & Gaztanaga 1998; Baugh et al. 2004; 
Croton et al. 2004b; Nichol et al. 2006). According to figure 
10 from Gaztanaga et al. (2005), the upturn in the hier- 
archical amplitudes appears to be equally important when 
using measurements of the 3-point correlations function of 
red and blue galaxies from the 2dFGRS -19 > M bj > -20 
sample. Nevertheless, it should be emphasised that, in the 



6 Croton et al. 



2.0 



o 



o 



.5 



O 1 .0 



0.5 



0.0 




0.0 0.5 1.0 

log (R/h _1 Mpc) 



0.0 0.5 1.0 1.5 

log (R/h" 1 Mpc) 



Figure 2. The hierarchical amplitudes, S p , plotted as a function of scale. The left panel shows the results for red galaxies and the right 
panel those obtained for blue galaxies. The symbols and lines have the same meaning as those used in Fig. [TJ The grey shaded regions 
show how the results for the L* galaxy sample change when the two largest superstructures are excluded from the CiC analysis, as 
described in the text. 



case of the 2dFGRS, the impact of the superstructures on 
the measured hierarchical amplitudes is most pronounced 
in the —19 > Ah, — 51og 10 h > —20 volume limited sample 
and is essentially negligible in the brighter and fainter sam- 
ples. This is due to the superstructures falling entirely within 
the redshift interval spanned by the L* sample. In the case 
of the fainter volume limited sample, the superstructures 
lie mostly beyond the maximum redshift which defines the 
sample, whilst in the case of the brighter sample, the volume 
covered is much larger and the significance of the superstruc- 
tures is correspondingly lower. See Croton et al. (2004b), 
Baugh et al. (2004) and Gaztanaga et al. (2005) for detailed 
investigations into how these two superstructures affect the 
measurement of higher-order statistics. The flatness of the 
earlier results for the 5 P can be understood by noting that 
a) most smaller surveys never intersected with these rare 
superstructures of galaxies, b) their influence for larger sur- 
veys, such as the original 2dFGRS parent catalogue, the 
APM Survey (i.e. Gaztanaga 1994) or 2MASS (Frith, Out- 
ram & Shanks 2006), was diluted by the significant increase 
in sample volume, and c) typically any contribution from 
large coherent superstructures is downgraded in a flux lim- 
ited catalogue rather than a volume limited sample due to 
the varying selection function. 

In Fig. [2] the grey shaded regions show the results for 
the hierarchical amplitudes measured from the Lj* T sample 
after removing the two large superstructures identified in 
Baugh et al. (2004). The shaded regions indicate the extent 
of the 1—a errors on the measurements and should be com- 
pared with the symbols and error bars, which show the re- 
sults for the same volume limited sample but without remov- 
ing the superstructures from the CiC analysis. By omitting 
the two superstructures, the theoretically expected constant 
behaviour of 53 is restored for larger smoothing lengths. 



However, whilst the results change on large scales for 54 and 
55 on removing the superstructures, there is still significant 
scale dependence of the 5 P with opposite gradients found 
for red and blue galaxies. From Fig. [2] the impression is 
that the red population is the most strongly affected by the 
presence of the superstructure^]. This behaviour should be 
contrasted with the measurements by Croton et al. (2004b) 
and Baugh et al. (2004) made using the full L£ 7 sample. 
In these papers, removing the superstructures from the CiC 
analysis, led, to within the accuracy of the measurements, 
to all the hierarchical moments up to 5 th order becoming 
roughly independent of scale. Our results for red and blue 
galaxies imply that the L* results for all galaxies were due 
to a fortuitous cancellation of the trends with scale seen for 
red and blue galaxies. 

Finally, it is worth noting that the upturn seen on small 
scales in both 54 and 5s for the bright and faint populations 
of blue galaxies is not statistically significant: in both cases, 
the jackknife errors, if plotted, are fully consistent with no 
upturn at all. 



3.3 The reduced void probability function 

In Fig. we plot the reduced void probability function 
(VPF) for the three luminosity samples (using different sym- 
bols as shown by the legend) and for red and blue galaxy 
populations (left and right hand panels respectively). Pre- 
vious measurements have revealed that galaxy samples de- 
fined by luminosity (or equivalently number density) dis- 
play a universal form for the VPF (e.g. Croton et al. 2004a; 

1 A robust quantitative statement would require the use of more 
realistic mocks, for which colour dependent absolute errors on S p 
can be derived. 
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Figure 3. The reduced void probability function plotted as a function of the scaling variable 7V£. Left and right panels show the 
measurements for the red and blue populations respectively, while the symbols denote galaxy samples of different luminosity as indicated 
by the legend. The grey shaded region shows how the results change for the L* sample when the two largest superstructures are removed 
from the analysis. 



Hoyle & Vogeley 2004; Patiri et al. 2006). This universality 
also holds for samples at different redshifts (Conroy et al. 
2005; 2007). The recovery of a universal form for the VPF is 
evidence for the hierarchical scaling of the higher-order cor- 
relation functions of galaxies (e.g. Eq[4}. The actual form 
of the VPF curve is well described, in redshift space, by the 
negative binomial model (see also Fry 1986 and Gaztanaga 
& Yokoyama 1993). 

Here we report a significant departure from the univer- 
sal scaling of the VPF. As can be seen in Fig. [3] the VPF 
measured for red galaxies deviates from the universal scaling 
in a way such that the deviation is strongest for the faintest 
galaxies considered. The VPF measured for blue galaxies, 
on the other hand, shows no dependence on luminosity, and 
matches almost perfectly the negative binomial model across 
the entire range plotted. Note that the deviation seen for the 
red L% population appears to arise from the contribution 
of the two superstructures discussed in the previous sub- 
section. Once these large structures are removed from the 
analysis and the VPF is recalculated, the results for the Lt 
sample shift back to the negative binomial model, as shown 
by the grey shaded region. On the other hand, the fainter 
red galaxy sample (shown by the triangles in Fig. [3} , which 
is not affected by the two superstructures, does not agree 
with the negative binomial model. This disagreement per- 
sists if we consider even fainter samples, which we have not 
plotted in the figure for clarity. 

Our results show that once we have accounted for the 
difference in mean density and clustering, faint red galaxies 
have a characteristically larger VPF than is measured for the 
other galaxy samples. From Eq. [6] we expect that samples 
with a larger VPF should also also have larger values for 
the S p , and this is borne out by the results given in Table 1, 
which are discussed in the next section. A high value for S p 



indicates non-linear biasing. If one were to simply increase 
the linear bias, this would increase the amplitude of the 
correlation functions, £ p , but would reduce the hierarchical 
amplitudes (see Eq. [4] and Eg. I12[) . A non-zero non- linear 
bias is characteristic of the high density regions of the dark 
matter distribution. This is in line with the suggestion made 
above that the red faint galaxies could be made up of a 
significant population of satellites in clusters. 

The reasons behind the success of the negative binomial 
model for the VPF are unclear. Vogeley et al. (1994) showed 
that the VPF measured in their simulations only agreed with 
the negative binomial model when the clustering pattern 
was distorted by peculiar motions; the VPF measured in 
real space did not match the negative binomial model. How- 
ever, our results imply that the peculiar motions of galaxies 
are not the primary agent responsible for the success of the 
negative binomial model. We find that the faint red galax- 
ies, which are primarily satellites within galaxy clusters and 
hence likely to display large peculiar motions, in fact show 
the strongest departure from the VPF predicted by the neg- 
ative binomial model, rather than being driven towards this 
model. 



4 DISCUSSION 

In this section we give some simple interpretations of the 
measurements of the hierarchical amplitudes reported in 
Section 13.21 This requires us to perform fits to the mea- 
surements, in the first case by extracting best fitting values 
for the hierarchical amplitudes, S p (as defined in Eq.2}, and 
then to constrain the linear and non-linear bias parameters 
(see Eq. [11] and Eq. [13} . 

Measures of the hierarchical amplitudes on different 
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smoothing scales are correlated, a fact which has often been 
ignored in previous analyses reported in the literature. The 
existence of a bin-to-bin correlation is clearly demonstrated 
by the response of our estimates of the hierarchical ampli- 
tudes to the removal of the two largest superstructures in the 
2dFGRS. Given this correlation, it is necessary to construct 
a covariance matrix for the measurements in order to carry 
out meaningful fits. One way to do this is to use a jackknife 
approach in which the survey volume is split into a number 
of equal sized subvolumes, on the order of 20 (if not more to 
construct a stable covariance matrix). Unfortunately, this is 
infeasible for our fainter volume limited samples, as the vol- 
ume covered in these cases is too small to be subdivided an 
appropriate number of times. Instead, we use the ensemble 
of mock 2dFGRS catalogues whose construction is described 
in Norberg et al. (2002b). Particles were chosen from an N- 
body simulation to represent galaxies using a simple para- 
metric function of the local smoothed density of dark matter 
(see Cole et al. 1998 for a description of the bias algorithm) . 
The parameters in this empirical biasing prescription were 
constrained to reproduce the typical clustering measured in 
the full flux limited 2dFGRS (Hawkins et al. 2003). Conse- 
quently, one limitation of the mock catalogues is that the lu- 
minosity and colour assigned to a "galaxy" is independent of 
the local density of the particle. This means that the mocks 
do not display any dependence of clustering on luminosity 
or colour. Also, no information about the higher-order clus- 
tering of mock galaxies was used to select the parameters 
in the bias prescription. Hence, although the two point cor- 
relation function for mock galaxies agrees very closely with 
that measured for the full 2dFGRS, there is no guarantee 
that the higher-order clustering in the mocks will look like 
that measured in the real survey. 

As we found in our earlier work using the mocks (Croton 
et al. 2004b; Gaztafiaga et al. 2005), the covariance matrices 
estimated from the mocks are stable and give robust esti- 
mates of best fitting quantities and the associated errors. A 
principal component decomposition of the covariance matrix 
reveals that the first few eigenvectors are typically responsi- 
ble for the bulk of the variance or signal. We rank the eigen- 
vectors in order of decreasing variance and retain sufficient 
eigenvectors to account for, at least, 95% of the variance. 

We first perform fits to extract values for the scale inde- 
pendent hierarchical amplitudes. Although our results may 
appear in some cases to show a strong dependence on cell 
radius (Fig. [5J), it is important to bear in mind that we can 
only show the diagonal part of the covariance matrix in plots 
and our fitting procedure takes into account the covariance 
between bins. Moreover, since there is a tradition of fitting 
constants to estimates of the S p over a range of scales, it is 
useful to repeat such an analysis for our measurements in or- 
der to facilitate comparisons with previous work. We restrict 
our attention to the S p values obtained for spheres with radii 
in the range R = 4.5 — 14/i _1 Mpc. These are slightly larger 
scales than we used in Croton et al. (2004b), since we aim 
to avoid scales which may be affected by shot noise, and our 
samples, as a result of being split by colour, typically have 
half the number of galaxies in each luminosity bin than be- 
fore. To help with a comparison to our earlier work we also 
re-analyse the "all colour" galaxy catalogues across this new 
fitting range (note that these galaxies also differ in their im- 
proved k + e corrections, as described in Section [2.1[> . The 



results of a one parameter fit (S p ) to the hierarchical ampli- 
tudes are quoted as a function of luminosity for all galaxy 
colours and also the red and blue populations separately in 
Table [1] in which we also give the 95 % confidence interval 
around the best fitting value. 

Next we constrain the relative bias parameters defined 
in Eq. [11] and Eq. 1131 Recall that in this paper we define 
our reference sample to be volume limited galaxies in the 
magnitude range —20 > Mt, — 51og 10 /i > —21. The error 
in the relative linear bias is straightforward to find using 
the mocks and the results are given in Table 1. Obtaining 
a robust fit for the nonlinear bias, c 2 , is more subtle. By 
construction, the mocks will return a zero nonlinear biaijj, 
whereas the expectation value for 2dFGRS samples can be 
anything. Therefore, in order to get a reliable estimate of the 
error on the best fitting nonlinear bias using the mocks, we 
need to take an indirect approach. We fit instead the com- 
bination brSs , for which the mock catalogues are in rather 
good agreement with the real survey. Taking the best fitting 
value for the quantity b r S§ , and the skewness measured for 
the reference sample, we can obtain a best fitting value for 
the nonlinear bias, c 2 , using Eq. 1131 The error on the best 
fitting value is obtained by adding the errors on fcrSf and 
the skewness of the reference sample in quadrature, ie. as- 
suming brSs and S3 3 * are uncorrelated. The resulting values 
of the nonlinear bias and the error on the best fit are listed 
in Table 1. 

The results in Table 1 reveal substantial differences in 
the hierarchical amplitudes obtained for red and blue galax- 
ies and in the way in which these amplitudes change with lu- 
minosity. Overall, we find that blue galaxies display smaller 
hierarchical amplitudes than red galaxies. Red galaxies show 
a significant change in S p with luminosity, whereas blue 
galaxies show no such trend. Assuming Gaussian errors, we 
find a 3-ct shift in the best fitting S3 between the faintest 
and brightest samples of red galaxies. In view of the rela- 
tively small baseline in luminosity over which we can per- 
form such a fit, a factor of six in median luminosity moving 
from the faintest to the brightest volume limited sample, it 
is remarkable to see such a clear change in the higher-order 
clustering. These results are in good agreement with those 
found in Croton et al. (2004b) (see their Fig. 10). It is also 
noteworthy that the relative bias fo r quoted in Table 1, shows 
the opposite tendency to that displayed by the hierarchical 
amplitudes: there is little dependence of the linear relative 
bias on luminosity for red galaxies and a clear trend for blue 
galaxies. 

Finally, it is encouraging to note that the conclusions 
we draw from our counts-in-cells analysis are consistent with 
those we reported in Gaztafiaga et al. (2005, SAGS-II), who 
measured the 3-point correlation function for different trian- 
gle shapes and sizes (i.e. without averaging over the volume 
of a cell). In particular Fig. 7 in SAGS-II shows how for both 
equilateral and elongated triangles, late galaxies show little 
luminosity segregation in Q3, while early galaxies show a 
strong segregation, with Q3 increasing with decreasing lu- 
minosity (see also Nishimichi et al. 2007). 



2 Without luminosity segregation, the expectation value for the 
mocks is c 2 ~ 0. 
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5 SUMMARY 

This paper complements and extends previous clustering 
analyses of the 2dFGRS, taking advantage of the availabil- 
ity of _Rp-band photometry to study the spatial distribution 
of galaxies as a function of both their colour and luminos- 
ity. We use a counts-in-cells approach to estimate the count 
probability distribution function (CPDF) of red and blue 
galaxies in different luminosity bins, spanning a factor of six 
in luminosity around L* . From the CPDF, we estimate the 
volume-averaged higher-order correlation functions, £ p , the 
hierarchical amplitudes, S p , and the reduced void probabil- 
ity function for our galaxy samples, %■ We use our measure- 
ments of the hierarchical amplitudes to constrain the linear 
and nonlinear bias parameters in a simple model of galaxy 
clustering. 

The main results of this paper can be summarised as 
follows: 

• Colour segregation in £ p : Blue galaxies show signifi- 
cantly lower clustering amplitudes than red galaxies. This 
trend holds up to the five point correlation function, ex- 
tending the results previously reported for two-point cor- 
relations (e.g. Norberg et al. 2002a; Madgwick et al. 2003; 
Zehavi et al. 2005). 

• Luminosity segregation in £ p : Blue galaxies display a 
monotonic increase of clustering strength with luminosity, 
mirroring the results found for the two-point function of 
2dFGRS galaxies (Norberg et al. 2002a). The behaviour of 
the red population is more complicated. Faint and bright 
red galaxies are more clustered than L* red galaxies. A hint 
of this behaviour was previously found for the two-point 
correlation function of early spectral types in the 2dFGRS 
(which loosely correspond to the red sample in this paper) 
by Norberg et al. (2002a). 

• Luminosity segregation in S p : We fit scale-independent 
models to the hierarchical amplitudes which take into ac- 
count the full covariance between adjacent bins. From this 
we conclude that the hierarchical amplitudes of blue galax- 
ies show little or no dependence on galaxy luminosity. On 
the other hand, the hierarchical amplitudes of red galaxies 
vary strongly with luminosity. These new results, split by 
colour, explain why Croton et al. (2004b) found only a weak 
dependence of the S p on luminosity. 

• Superstructures in the 2dFGRS. The hierarchical ampli- 
tudes for the red galaxies seems to be more affected by the 
presence of the superstructures. After removing them, there 
is still significant scale dependence of the S p , with opposite 
gradients found for red and blue galaxies. These results im- 
ply that the flatness of the L* S p measurements of Croton 
et al. (2004b) and Baugh et al. (2004), after removing the 
superstructures, are due to a fortuitous cancellation of the 
different trends with scale for red and blue galaxies. 

• The reduced void probability function. Croton et al. 
(2004a) found that the reduced void probability function 
measured for 2dFGRS samples denned by luminosity dis- 
played a universal form, and matches the one predicted by 
a negative binomial distribution. Splitting the samples by 
colour, we find that blue galaxies show the universal reduced 
void probability function consistent with the negative bino- 
mial, but red galaxies do not. The deviation from the neg- 
ative binomial model is largest for faint red galaxies. This 
result is seemingly at odds with previous interpretations of 



the success of the negative binomial model, in which it was 
suggested that galaxy peculiar motions were the primary 
agent behind the form of the reduced void probability func- 
tion. 

• Linear and non-linear bias. Fry & Gaztanaga (f993) in- 
troduced a simple model of galaxy bias in which the density 
contrast in galaxies is written as a Taylor expansion of the 
density contrast in the underlying mass. The first order bias 
term in this expansion is the common linear bias and the sec- 
ond order term is called the quadratic or nonlinear bias. We 
use our measurements of the variance and the skewness (S p ) 
to extract the linear and nonlinear bias parameters relative 
to the clustering in a reference sample. The reference sam- 
ple is the one for which we are able to make our best mea- 
surements of galaxy clustering. In this paper, galaxies with 
magnitudes in the range —21 < M bj — 51og 10 h < —20 are 
treated as the reference sample; there is a reference sample 
for red galaxies and one for blue galaxies. The relative bias 
parameters extracted for red and blue galaxies are different. 
The faintest red galaxies we consider have a linear bias con- 
sistent with that of the reference sample; red L* galaxies 
have a linear bias below unity. For blue galaxies, the linear 
bias increases with luminosity. The relative non-linear bias 
for red galaxies is positive whereas that extracted for blue 
galaxies is negative. In all cases, these offsets are significant 
at the 1 — 2a level. 

Our measurements of the higher-order clustering of 
galaxies as a function of colour and luminosity provide valu- 
able new constraints on models of galaxy formation. While 
the colour of a galaxy is determined by its star formation his- 
tory, the clustering of galaxies, at least on the scales probed 
in this paper, is driven primarily by the mass of the host dark 
matter halo and to a lesser extent by the formation history of 
the halo (Gao et al. 2005; Harker et al. 2006; Wechsler et al. 
2006; Croton, Gao & White 2007). Semi-analytical models of 
galaxy formation make ab initio predictions for the star for- 
mation histories of galaxies in a cosmological setting (see the 
review by Baugh 2006). This is done through simple physical 
prescriptions that describe those aspects of galaxy evolution 
believed to be important, including the rate of gas cooling, 
the timescale for star formation in galactic disks, mergers 
between galaxies, and feedback processes such as heating by 
supernova explosions or the accretion of material onto super- 
massive black holes. These processes are poorly understood 
and consequently, there is no unique way in which to model 
them. 

At present there are a number of galaxy formation mod- 
els which follow the evolution of disks and spheroids from 
high redshift to the present day (e.g. Baugh et al. 2005; 
Croton et al. 2006; Bower et al. 2006; Cattaneo et al. 2006; 
De Lucia & Blaizot 2007). The parameters that constrain 
the simple physics assumed in these models are typically set 
without reference to galaxy clustering. Higher-order clus- 
tering and non-linear bias measures are sensitive to how 
galaxies populate groups and clusters, and their periphery. 
We therefore look towards such statistics to further con- 
strain and discriminate between different possible (and plau- 
sible) implementations of the galaxy formation physics. The 
higher-order spatial distribution of galaxies provides an ad- 
ditional window through which we can hope to understand 
the complex physics governing galaxy evolution. 
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